SCALING LIMITS OF THE UNIFORM SPANNING TREE 
AND LOOP-ERASED RANDOM WALK ON FINITE 

GRAPHS 

YUVAL PERES*, DAVID REVELLE 1 " 

Abstract. Let x and y be chosen uniformly in a graph G. We find 
the limiting distribution of the length of a loop-erased random walk 
from x to y on a large class of graphs that include the torus for 
d > 5. Moreover, on this family of graphs we show that a suitably 
normalized finite-dimensional scaling limit of the uniform spanning tree 
is a Brownian continuum random tree. 



1. Introduction 

The uniform spanning tree (UST) T on a graph G is a random tree, 
uniformly distributed among all spanning trees of G. For two points x and 
y in a graph, let d-r[x, y) be the distance from x to y in T. For the complete 
graph K m , the distance dr(x, y) is on the order of y/m and the distribution 
satisfies 

F[d r (x,y) > AvH = exp [-A 2 /2] + o(l). 

Moreover, rescaling by dividing edge lengths by \fm results in a scaling limit 
for the UST on K m that is the Brownian continuum random tree (CRT) 
(We will recall the construction of the Brownian CRT in fll.ll ) Pitman 
|16j conjectured that the Brownian CRT should also be the scaling limit of 
the UST in certain other graphs, and in particular should be the scaling 
limit of the UST on the d-dimensional discrete torus for large d. This 
conjecture is supported by a recent result of Benjamini and Kozma 
who showed that the expected distance of loop- erased random walk (LERW) 
between two uniformly chosen points on Z^ is on the order of n d l 2 for d > 5. 
Pemantle ^5] proved that for any x and y, dq-(x,y) is equal in distribution 
to the length of a loop-erased random walk from x to y, so Benjamini and 
Kozma's result also gives some information about T. In Theorem 11.11 we 
confirm that Pitman's conjecture holds for d > 5. 

Theorem 1.1. Let x and y be independently and uniformly chosen in 
and let d-r(x, y) denote the distance between x and y in the UST. For d > 5, 
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there exists a constant /3(d) > such that 



(1) 



lim F[d r (x,y) > f3(d)Xn d/2 \ = exp 



A 2 1 



n 



oo 



2 



Moreover, if {x±, . . . ,Xk} are uniformly chosen from Zj[, then as n tends to 
infinity, the joint distribution of 



converges in distribution to F^, the joint distribution of distances between 
the first k points of the Poisson line breaking construction of the Brownian 



In Section El we express (3(d) in the form j(d)/ya(d), where 7(d) and 
a(d) are probabilities of events involving random walk on Z, d with a(d) — > 1 
and 7(d) — ► 1 as d — ► 00. 

Using Pemantle's ^Hl result that dr(x,y) is equal in distribution to the 
length of a loop-erased random walk from x to y, Q gives the limiting 
distribution of the length of LERW from x to y in Z^. In Subsection 11.31 
we will recall Wilson's algorithm for constructing the UST, which gives even 
stronger connections between the UST and LERW. 

1.1. Construction of the CRT and We define the distribution F}- 
referred to in Theorem ll.il via the Poisson line breaking construction of the 
Brownian CRT. Let si,S2,... be the arrival times for an inhomogeneous 
Poisson process whose arrival rate at time t is t. Draw an initial segment 
of length s\, and label its ends y\ and 7/2- Pick a point uniformly on this 
segment, attach a new segment of length S2 — s\, and label the end of this 
segment 2/3. To continue inductively, given a tree with k ends y%, y2, ■ ■ ■ , y& 
and total edge length s&_i, pick a point uniformly on the tree and attach to 
this point a segment of length Sk — Sfc-i, and label the new end yk+i- We 
let F}. denote the joint distribution of the distances between the k points 
yi, . . . , yf.. Note that F2 is simply the distribution of s%, which is given by 



As we are only interested in fe-point distributions of the UST on graphs, 
we can stop our construction at time s^_i. More generally, as t tends to in- 
finity, the resulting sequence of trees, viewed as a sequence of metric spaces, 
converges to a random, compact metric space. The limit is known as the 
Brownian CRT. As shown in 1 , for y%, . . . y y^ £ K m , the joint distribution 
of dq-(yi, yj)/^/rn converges to F^ as m — » 00. Our main arguments involve 
coupling LERW on G n with LERW on K m , and we will obtain the results 
about the scaling limit of UST on G n via this coupling. 

For a further discussion of the Brownian CRT, see the original papers of 
Aldous ^ 12 Ej or the lecture notes of Pitman \Tf\ . 



dr(xj,Xj) 
[3(d)n d / 2 



CRT. 



W[d(y 1 ,y 2 ) > A] = exp - — 
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1.2. Other Graphs. The limiting behavior in Q is not universal. On a 
cycle, for instance, the typical distance is on the order of \G n \ rather than 
jGnj 1 / 2 . The case of Z 2 is quite hard, but recently Kenyon El showed 
that the typical distance of LERW is on the order of |G n | 5//8 , and Schramm 
|19j and Lawler, Schramm, and Werner ^2] have studied the scaling limit 
of the UST. 

There are also examples of graphs that are not vertex transitive, such as 
a star, in which the typical distance can be much less than IG^I 1 / 2 . Never- 
theless, our methods also apply to a broader class of graphs, including the 
hypercubes TH^ and expander graphs. As a generalization of Theorem 11.11 
we will give a set of three conditions such that whenever all three hold, a 
suitable scaling limit is again the Brownian CRT. The three assumptions 
that we will make are vertex transitivity, a bounded number of local inter- 
sections of two independent random walks, and relatively fast mixing of the 
random walk. 

More formally, let p t (x) denote the distribution of simple random walk 
at time t started at a basepoint o € G n . Our first assumption is that there 
exists a constant such that 

|G„|V2 

(2) sup sup ^ (t + l)p\x) <9<oo. 

On Z d , p t (0) decays like t~ d / 2 , so for d > 5, a condition similar to © holds 
in that S^otp*(0) is bounded. The implication on Z d is that two random 
walks with the same starting point intersect each other finitely often (see 
e -g-i |H| - Theorem 3.5.1). Lemma 14.11 will show that © is an analog for 
finite graphs that says that two independent random walks starting at the 
same point only intersect a bounded number of times in the first (G^ 1 / 2 
steps. 

Let 7r = 7T n (-) denote the stationary distribution of the walk on G n . De- 
note the (uniform) mixing time by 

p t (x) 



T n = inf \ t : sup 

x&G n 



1 

< - 

~ 2 



7r(x) 

Our second assumption is that for some 5 > 0, 
(3) r n = o(|G n | 1 / 2 -'). 

Note that this mixing time assumption implicitly requires that the walk be 
aperiodic, as otherwise no such r exists. As adding a holding probability of 
1/2 to a random walk does not affect LERW, this aperiodicity assumption 
does not affect the final LERW or the UST. We will show in Sections |H1 
and IH1 that examples of graphs satisfying (j2j) and (j3J) include the tori 
for d > 5, the hypercube T^, expanders, and the complete graph K n . For 
vertex transitive graphs, conditions (J2J and (j3J) are sufficient to generalize 

o 
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Theorem 1.2. Suppose that {G n } is a sequence of vertex transitive graphs 
satisfying an d © smc/i i/iat |G n | — ► oo. T/ien i/iere exists a sequence 
of constants {(3 n } with 

< inf /3 n < sup P n < oo 
such that for x and y uniformly chosen from G n , 



(4) lim ¥[d T (x,y) > /3„A|G n | 1//2 ] = exp 



n— >oo 



A? 
2 



Moreover, ifk points {x±, . . . , x^} are chosen independently and uniformly 
from G n , then the joint distribution of 



(5) 

converges to as n —* oo. 



d r (xj,Xj) 



Note that the difference between Theorem 1 1 . 1 1 and Theorem ll.2l is that we 
potentially need a bounded sequence j3 n , rather than having a single scaling 
constant (3. If we strengthen assumption (|2"|). then the factor of /3 n in the 
rescaling turns out to be identically 1. More precisely, (J2J) says that the 
number of intersections of two walks run for |G n | 1//2 steps is bounded. To 
strengthen that, let {X{\ and {Y u } be independent random walks on G n , 
take q = (rlG^I 1 / 2 ) 1 / 2 , and suppose that 

(6) supE[|{X^ =0 n {Y u } q u=0 \ \X = x,Y = y}^0 

as n — ► oo. Note that condition © does not hold if the graphs {G n } have 
uniformly bounded degree as then the probability that two walks started 
at the same point will intersect after 1 step is bounded away from 0. It 
does hold, however, for the hypercubes or the complete graphs. The local 
intersections are what caused us to require that the points {xi} be chosen 
uniformly, and that (3 n ^ 1. Thus assumption © yields the following, 
stronger result. 

Theorem 1.3. Let {G n } be a sequence of vertex transitive graphs such that 
{3p and Jj^ hold. Then for any choices of k distinct vertices {x^"\ . . . ,x^} 
in G n , the joint distributions of 

(7) d T {xt\xf) 



\G |V2 

converge to F^ when n — > oo. 

In Section fl .31 we define LERW and recall Wilson's algorithm. In Section 
|2]we give an outline of the proof of Theorem ll.il explain the significance of 
the size IGjij 1 / 2 , and discuss the structure of the rest of the paper. 
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1.3. Loop-erased random walks and Wilson's algorithm. Here and 
throughout this paper, we will use (•) to denote sequences when order is 
important, and {•} to denote sets. 

Given a finite path 7 = (uq,u±, . . . in a graph G, let LE(7) denote 
the loop- erasure of 7 with loops erased in chronological order. Formally, 
LE(7) is the sequence (vo,v\, . . .) constructed recursively as follows: first, 
let vq = uq; then, given v r , let k be the last time that Uk = v r , and let 
tV+i = Uk+ii with the convention that if k = £, then v r is the last term of 
LE(7). In the case when 7 is the path of a random walk that starts at x 
and is stopped when it reaches y, we call LE(7) a loop-erased random walk 
(LERW) from x to y. More generally, if 7 is a random walk from x stopped 
when it hits a set S, then we say LE(7) is LERW from x to S. 

For an unweighted graph G, the UST on G is connected to LERW on G 
by Wilson's algorithm for constructing a UST: pick a root vertex p, and form 
an initial subtree T\ by picking another vertex x\ and running LERW from 
x\ to p. We then proceed recursively as follows: given a subtree %, pick a 
vertex Xi+\ and run LERW from Xi+\ to %. Let %+i be the union of % 
with this new path. Proceeding until % is a spanning tree yields a UST 20 . 
Taking p = y and x% = x yields Pemantle's result ^5] that the distribution 
of the path from x to y within the UST is the same as the distribution of 
LERW from x to y. We will use the fact that Wilson's algorithm is very 
robust-in particular, the sequence {xi} may be chosen arbitrarily, and it 
also applies to weighted graphs. To apply Wilson's algorithm to weighted 
graphs, we let LERW denote the loop-erasure of a weighted random walk 
that moves according to the weights on the graphs. The probability of any 
given spanning tree resulting from Wilson's algorithm then is proportional 
to the product of the weights of the edges of the tree, so Wilson's algorithm 
yields a weighted UST [20]. 

As mentioned earlier, the limiting distribution of dq-(x, y) on the complete 
graph K m is given by 

¥[dr(x,y) > \y/m\ = exp [-A 2 /2] + o(l). 

Because there is no geometry on the complete graph, there are a number 
of derivations of this limit (c.f., [0]). One such argument that uses the con- 
nection between the UST and the LERW is as follows: Let 7 = (27, . . . , xg) 
denote the loop-erased path from x to y, so x\ = x and xg = y. Conditioned 
on 7, = (xi, . . . , x^ , we want to compute IPfxj+i = z]. Considering the 
walk after the last visit to 7$, we want to condition on never returning to 
7j. Let f(z) be the probability that the next step is to z and the walk then 
reaches y before hitting 73. We have f(y) = l/(m — 1), and by symmetry, 
f(z) = l/[(m — l)(i + 1)] for z G K m \ {7^ Uy}. As / = on 7$, conditioning 
on the union of these events occurring show that the probability of stepping 
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to y at step (i + 1) is thus (i + l)/m. In particular 

rjfc-i 



r(x,y) = k] = — — 
m 



???, 



=1 

Rescaling by m -1 / 2 and letting m — ► oo, this distribution converges to the 
Rayleigh distribution. Generalizing this argument for graphs with more 
structure is difficult, so we will use other techniques. 

2. Outline of Proof 

To prove Theorem ll.2l we would like to use Wilson's algorithm and make 
a comparison with the Poisson line breaking construction. Unfortunately, if 
x and y are uniformly chosen from a graph G n that satisfies the hypotheses 
of Theorem II. 21 it turns out that the typical random walk from x to y takes 
on the order of \G n \ steps. In the special case of G n = Z^, Benjamini and 
Kozma showed that the typical length of the loop-erased path is only of 
the order of |G n | 1//2 , meaning that almost all of the original path is erased. 
Theorem 11.21 implies that this problem occurs more generally, so to avoid it 
we would prefer to only run a random walk for the order of |G n | 1//2 steps. 

To do this, for a given L, consider the extension G* L of G n formed by 
adding a vertex p, with an edge from every vertex of G n to p of weight such 
that a weighted random walk on G* L is simple random walk on G n modified 

to move to p after a geometric number of steps with mean L\G n \ l l 2 . We 
will run Wilson's algorithm on G* L with root vertex p. This results in a 
spanning tree T* on G* L , rather than the uniform spanning tree T on G n . 

The tree T* induces a spanning forest T on G n formed by restricting T* 
to G n . Let dr{x,y) and d^(x,y) denote the distance between x and y in T 
and T respectively, with the convention that d^-(x,y) = oo if x and y are 

in different components of T . This spanning forest T is comparable in the 
following sense to the UST T: 

Lemma 2.1. Let {G n } be a sequence of vertex transitive graphs such that pj[) 
and |0} hold with constants and 5 respectively, and such that \G n \ — ► oo. 
Pick k points {x\, . . . ,Xk} uniformly and independently from G n . For any 
e > 0, there exist constants Lq and N such that if L > Lq and n> N , then 
the total variation distance between the joint distribution of the distances 
dq-(xi,Xj) and the joint distribution of the distances d^(xi,Xj) is less than 

We will prove Lemma 12.11 in Section [Tj The point of Lemma 12.11 is that 
it now suffices to understand T rather than T. To do so, we will use a 
comparison with an extension of the complete graph K m . Let K m denote the 
complete graph with a self-loop at every vertex, and construct an extension 
K* ~ of K m in the same way that G* T was constructed, meaning that K* ~ 

m,L m J n,L ' 6 mL 

is formed by adding a vertex p that is connected to every vertex of K m with 



SCALING LIMITS OF UST 



7 



an edge of weight m/(LyJm — 1) and so weighted random walk on K* ~ is 
simple random walk on K m that jumps to p after a geometric number of 
steps with mean Ly/m. 

On G n , pick {x\, . . . , x^} uniformly and independently. Running Wilson's 
algorithm on G* n L with root vertex p, let Tq = {p}. Given Tg, let 7£ +1 be 
formed by running LERW from x^ + \ to 7^. Note that T\ is LERW on G* nL 

from x\ to p. We call 7^ the £-th partial spanning tree. Likewise, let 7^ 
denote the ^-th partial spanning tree on K* ~. We want to select m and L 

in such a way to couple 7^ and 7^. To couple both hitting probabilities and 
length, we need to consider the following definition of capacity. 

Definition 1. Let {Xt} be simple random walk on G n . For S C G n , let 
Ts = min{i > : Xt € S} denote the hitting time of S. The r-capacity of S 
is given by 

Cap r (5)=P w [r 5 <r]. 
By considering the expected number of visits to S in the interval [0, r), 

(8) Cap r (5) < rvr(S) = ^\ 

l^nl 

for any set S. When S is a segment of a random walk, we will show in 
Lemma 15. Ml that the bound in (jBJ) is, with high probability, sharp up to 
constants. 

Definition 2. Let {xi, . . . , xt} and {y\, . . . , y^} be uniformly and indepen- 
dently chosen from G n and K m respectively. For any C, L, L, a, (3, 5, and 
r, the partial spanning trees 7& and 7^ on G* T and K* ~ are said to be 

n ' lj m,L 

successfully coupled if the following three conditions hold: 



(9) 

for all i,j < k, 
(10) 



Cap r (T fc )|G n |V2 



1711 



a 



1/2, 



< C|G, 



-5/32 



dr k {xi,Xj) d Tk (y i ,y j/ 



/3|G n |V2 



m 



< C\G n \~ s/32 



Let p be the uniform measure on 7& and v the hitting measure, meaning that 
if {Xt} is simple random walk on G n and T = inf{t > : Xt £ T~k}, then 
v{x) = P[Xy = x]. There is a coupling of p and v such that if £ is chosen 
according to p and rj according to v, then 

(11) n<hj,(Z,v) < C\G n \ l l 2 - s l™} > 1 - C(\G n \- s / 32 ). 

Lemma 2.2. Let {G n } be a sequence of vertex transitive graphs with \G n \ — ► 
oo such that and hold with constants 8 and 5 respectively. Then 
there are sequences of constants a = a(n), (3 = (3{n) and r = r(n), and 
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for any L and e > 0, there are sequences L = L(L,n) and m = m(n), and 
a constant C = C{k) such that if {x%, . . . , Xk} and {y%, . . . , yt} are chosen 
independently without replacement from G n and K m respectively, then for 
any n > N, there is a coupling ofT^ andT^ that is successful with probability 
1 — C(k) (|G n | -5 / 32 ). Moreover, m and L can be made arbitrarily large by 
first taking L and then n sufficiently large. 

Theorem 11.21 is then a consequence of IjlOjl , along with Lemma 12.11 and 
Aldous' theorem that the Brownian CRT is the scaling limit of UST on K m . 
Although only 1)1(7 [) is needed to imply Theorem 11.21 we add conditions © 
and (jllj) to Lemma 12.21 because the proof will be by induction, and the 
inductive step requires those two pieces. 

The key to the proof is a rescaling argument that makes use of the facts 
that loops formed by a typical random walk are either very short or very 
long. More formally, let {X{\ be a simple random walk on G n run for 
L\G n \ 1 / 2 steps. Call a loop short if it is of length at most r, and long if it is 
of length at least IG^ 1 ^ 6 . Ht-u>r, then F[X t = X u ] < 2/\G n \, so the 
expected number of loops of length between r and |G' ra | 1 / 2_<5 formed by time 
L\G n \ 1 / 2 is bounded by 2L\G n \~ 5 . In particular, the probability of having 
loops of this type of intermediate length tends to 0. 

In Section |3] we will describe a rescaling argument that takes advantage 
of the absence of intermediate length loops. In order to make this rescaling 
argument work, we will show in Section |1] that short pieces of a loop-erased 
random walk retain a positive proportion of their length. Section [21 then 
combines these pieces of the walk to show that longer lengths of loop-erased 
walk have a behavior that is tightly concentrated around the mean behavior. 
Section is then devoted to using these pieces to prove Lemma 12.21 In 
Section [7J we prove Lemma 12.11 thus concluding the proof of Theorem 11.21 
In Section |H] we then prove Theorem ll.il and in Section |U] we prove Theorem 
PI 

3. Key definitions and introduction of rescaling 

As the inductive step in the proof of Lemma 12.21 requires running LERW 
from Xfc+i to 7fc and showing that this new length is similar to what occurs 

on K* ~ , we begin by considering what happens on K* ~ . 

m,V ° ^ m,L 

Suppose that S C K* ~ is a subset such that p £ S. Pick a point y £ K* ~ 

772 , Li Tfh , ±j 

with y ^ p, and let {Y u \ denote weighted random walk on K* ~ started at 

m,L 

y and stopped at time 

T = min{u > : Y u G S}. 

We wish to understand LE(Y U )„ =0 , and in particular its length. To do this, 
let 

(12) lij = 
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be a collection of indicator random variables that keep track of intersections. 
One special property of the complete graph is that, for fixed j, the joint 
distribution of Ijj as i varies conditioned on the values of Iki for k,£ < j 
is the same as the joint distribution conditioned on {Y u } u< j. Since we are 
only interested in the length of LERW from y to S, the lack of geometry on 
K m means that we do not lose information by only keeping track of the time 
indices instead of locations of {1^}. Moreover, keeping track of times instead 
of locations will generalize more appropriately for our rescaling argument 
when we consider LERW on G* n L . 

We now inductively construct a family of sequences {Sj} that record 
what time indices have survived loop-erasure up to time j. Let our initial 
sequence of length one be given by Sq = (0). Letting * denote concatenation 
of sequences, for < j < T let 

(13) Sj = (k€ Sj-i : lij = 0Vi<k,ie Sj-i) * {j}. 

The result of this definition is that Sj consists of the original time indices 
of the walk that have survived loop-erasure at time j, with the convention 
that when a loop is formed, the original time index is removed and the new 
time is retained. One consequence of this is that for j < T, 

LB(Y u )i =0 = (Y u ) ue§ .. 

For the random walk on G* L , we will again define a collection of indicator 
random variables {lij} and index sets {Sj}, but the difference now is that the 
indices will represent a moderately long segment of the loop-erased random 
walk instead of individual points. 

In the rest of this paper, we will be using a variety of different time scales; 
for ease of reference we give here a summary of the meanings of the different 
scales on which we will be working. First, weighted random walk on the 
extension G* n L started in G n moves to p after a geometric number of steps 

that is on the order of IGnl 1 / 2 . Recall that r = r n denotes the mixing time 
for random walk on G n . We will break up |G n | 1//2 into shorter segments 
Ai of length that is asymptotically r = LT 1//4 |G n | 3 / 8 J . By assumption ©, 
r, and thus r, is of a lower order than |G n | 1//2 . To show that the behavior 
of the walk on each run of length r is close to its mean behavior, we will 
further break these runs into smaller pieces of length q = Lr 1/2 |G„| 1/4 J 
and then sum the pieces to get large deviation estimates. These estimates 
require independence between the segments, so instead of considering the 
loop-erased path, we will consider a local loop-erasure, with a window size 
s = Lr 3 / 4 |G n | 1//8 J that is much smaller than q. To justify that the restriction 
to local loop-erasure does not throw away too much information, we will then 
finally show that there are a number of local cutpoints, for which we will 
only look at path segments whose length is on the order of r. As a summary, 
see Tabled 
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typical length of LERW 


r= L T l/4| Gn |3/8j 


length of Ai 


q=[r^\G n \^\ 


subdivisions of Ai 


s = Lr^lGnlVSj 


window for local loop-erasure 


T = T n 


mixing time, and window for local cutpoints 



Table 1. Definitions of scales 



To remember these relative sizes, note that {r, s, q, r, jGnl 1 / 2 } is approxi- 
mately a geometric sequence with common ratio r _1 / 4 |G n | 1 / 8 . 

As mentioned above, in order to get the independence between various 
pieces that we need for our large deviation estimates, we will need to use a 
local loop-erasure instead of the original loop-erasure. 

Definition 3. Let {X{\ denote a weighted random walk on G* L with Xq ^ p 
that is stopped at time T . A time u is locally retained if 

LE<x^ =max{0i _ } n {x t} rsr s} = 0- 

Definition 4. For a stopping time T , let U be the set of all times t <T that 
are locally retained. The local loop-erasure h'E s (X t }J = Q is the subsequence 
of (X t }f =0 such that 

~L^>s(Xt)J =0 := {X u ) ue u- 

When the stopping time T is not specified, we will take T = oo in the 
definition of the local loop-erasure. 

It is not a priori true that either the local loop-erasure or the loop-erasure 
contains the other. For example, if the original path has a loop of length 
s — 1, the local loop-erasure could have a jump, while if the original path 
has a loop of length just longer than s, then the local loop-erasure can have 
short loops. These differences raise a problem that we will have to deal with 
later. We will later formalize the notion that, with high probability, the local 
loop-erasure has no jumps, and that the main difference between the local 
loop-erasure and the loop-erasure comes from having long loops (meaning 
of length greater than r). Our coupling with LERW on the complete graph 
will keep track of the long loops. 

Given a set S C G* nL with p G S, pick a point x £ G n , and let {X t } 
be weighted random walk on G* n L with Xq = x and run until time T = 
mm{t > : X t G S}. 

For i < T/r, let 

(14) Ai = Ai(r, s) = Zn[(j- l)r + 2s + 1, ir - s]. 

As s = o(r), the number of elements of Ai is asymptotically r. Adding a 
buffer of length s at the beginning and end of Ai means that the times that 
are locally retained within the different Ai are independent. The second 
delay of s at the start of Ai will also mean that the locations of the path on 
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different Ai are close to independent. We will let LE s (Aj) denote the part 
of the local loop-erasure of {Xt} whose original times were in Ai, that is to 
say 

LE s (Aj) := (X t )teAinu- 

Since p £ S, the hitting time T is on the order of at most IGVJ 1 / 2 , so the 
number of intervals Ai is at most on the order of \G n \ l l 2 /r. We now wish 
to keep track of non-local loops, meaning loops that somehow involve two 
of the Ai. For i < j, let Ij,- denote indicator random variables 



(15) I 



1 {LE s (Aj) n {X t } teAj + 0}, 
{LE a (Ai) n {X t } teAj = 0}. 



Unlike the case of the complete graph, here the joint distribution of Iy 
conditioned on {Xt}t< r j is different from the joint distribution of Iy con- 
ditioned on {Ifcg, k, £ < j}. Despite this, we can still recursively construct 
a family of sequences Sj that in some sense records which runs LE s (j4j) 
survive loop-erasure. The construction will implicitly take the entire path 
{Xt} into consideration. To begin, let So = (0). For < j < \T/r~\, let * 
denote concatenation of sequences, and let 

(16) Sj = (ke Sj-i : lij = 0Vi<k,ie Sj-i) * {j}. 

These sequences are intended to play much the same role as {Sj}, and 
can be thought of as keeping track of indices i such that LE s (Aj) is com- 
pletely contained inside LE(X^)^ =0 . There are some problems: if LE s (^4j) 
is involved in a long loop, then part of it is erased. In particular, it is not 
true that LE s (^4j) C LE(X^)^ =0 , and also if the long loop involves LE S (A,), 
then only one of i or j is retained in Sj. There is also a problem when one 
end of a long loop is in the gap of length 3s between the A4. We will later 
prove that these differences are sufficiently rare that their contribution is of 
a lower order of magnitude than the length of LERW. 

In Section El we will show that the length of LE s (j4j) is tightly concen- 
trated about its mean jr for some 7 = 7(71) that is bounded away from 
0. This implies that \L'E(Xt)f =0 \ is roughly 7r|«S|7y r j|. We will prove the 
induction step of Lemma 12.21 in Section El but to understand the idea, take 
S = Tu. We want to pick an m that lets us couple LERW on G* r and K* ~ 

"■'-^ m,L 

in such a way that \Syx /rj I = \Sf\i where T is the hitting time for 7^ C K* ~. 
To do this, we need the distribution of Lj to be close to the distribution of 
lij. Let Cap r (S') denote the capacity of S as defined in Definition ^ 
Define 7 = 7(n) and a = a(n) by 

(17) 7 = E |LE ' (A)I , a = ECa Pr [LE s (A t )]^. 



In Lemma |5.3| we will show that both |LE s (^4j)| and Cap r [LE s (^4j)] are 
tightly concentrated about their means. The capacity bound implies that 
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Ely is approximately ar 2 /\G n \. On K m , Ely = (1 + o(l))/m, so in our 
rescaling we will take the size of the complete graph to be 

\G n 



m 



ar^ 



and let 



7 



a 



1/2' 



For the special case of S = Tq = {p}, taking T to be the hitting time of p 

lg that 

X\G n \^' 



on K* ~, the two point distribution reduces to showing that 

m,L 



|LE(X t )f =0 | > -^X\G n \^ 2 

OL I 



\S 



LT/rJ 



> 



Q/l/2 r 
iS™ > Aa/tTI + o(l) 



which tends to exp[— A 2 /2] &s m,L —> oo. 

To obtain (|T|). in Section |H] we will show that, on the torus, the limits of 
j(n) and a(n) exist. Computing these limits and replacing (3 n by the limit 
of (5 n will then prove Theorem ll.il 

In the next few sections, we will develop the tools needed to prove Lemma 

4. Positive length of small pieces 

The aim of this section is to study local-loop erasure of runs of length 
roughly q « r 1 / 2 |G n | 1//4 and show that LE s (X t )j~2s+i retains a positive 
proportion of the original walk. Note that the buffers of length s and 2s at 
the start and end of these pieces of length q are the same size as in A{ and 
serve the same role. The fact that erasing loops shortens the path, along 
with ((HJ), gives the upper bounds 



E 



2.s 



ECap r [LE s (X, 



q—s ■ 
t/t=2s. 



\Gn\ 

qr 



< 1, 



< 1, 



where Cap r (S') is as in Definition ^ The focus of this section will be giving 
lower bounds for these quantities, and in particular showing that they are 
bounded away from 0. 

For random walks on Z d , the fact that ^ kF{Xk = 0] < oo for d > 5 
is equivalent to the fact that two simple random walks on 7L d will intersect 
each other finitely often in dimensions 5 and higher. The next lemma makes 
more precise the fact that condition (J2J) gives a local analog. 

Lemma 4.1. Suppose that {G n } is a sequence of vertex transitive graphs 
satisfying with constant 9. Let {X t } and {Y t } be independent random 
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walks started at the same point o. Let {Xt} and {Yt} be the walks {X{\ and 
{Yt} killed at random times Tx andTy, which are geometrically distributed 
random variables with mean (1 — A) -1 . Letting 8 be the constant from 

(is) n{x t }t>o n {Y t }t>i = 0) > ( e + 



\G n \(l-Xf 

Proof. We use the central idea of the proof of Proposition 3.2.2 in Call 
a pair of times a *-last intersection if 

{(t,u) :X t = Y u ,t>t,u> j} = {{i,j)}. 

Abbreviate 

/(A) =F({I ( } i > n{y t } t > 1 = 0). 

Note that F[X { = Yj] = X i+j F[Xi = Yj], so by vertex transitivity and the 
memoryless property of exponential random variables, the probability that 
(i,j) is a *-last intersection is bounded by 

\ i +iF[X i = Y j ]f(\). 

Because the killed paths are finite in length, there is at least one Mast 
intersection. By symmetry of the walk, P[Xj = Yj] = ¥[Xi + j = o], so 
considering the expected number of *-last intersections gives 

oo oo 

i=0 j=0 
oo 

= Y J X k (k + mAX k = o]f(X) 

< ( ' E (* + Wo[X k = o]+ f; ^^AM /(A) 



□ 



For a path (X t )f =0 , a point X u is a local cutpoint if {X U _ T , . . . , X u ^\} n 
{X u+ i, . . . , X u+T } = 0. Lemma |4.1l and condition © imply that the ex- 
pected number of local cutpoints of random walk is a positive propor- 
tion of the length of the path: taking 1 — A = (T|G n | 1//2 ) _1//2 means that 
[(1 - A) 2 |G n |] _1 tends to 0, and also that P[min{T x ,Ty} > r] = 1 - o(l). 
Lemma 14.11 then implies that the probability that a given point is a local 
cutpoint is at least 1/6 + o(l). The significance of local cutpoints is that 
conditioned on not having long loops (meaning loops of length greater than 
t), local cutpoints are also retained in the loop-erasure. 
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Corollary 4.1. Suppose that {G n } is a sequence of graphs satisfying the 
assumptions of Theorem and let {Xt} be simple random walk on G n 
with geometric killing rate {L\G n \ l l 2 )~ l . Let T be the killing time, and let 
T n be the event that for all t £ [0, T — s], the interval [t,t + s] contains 
a local cutpoint. Then there is a constant C such that for any s, P[r n ] > 
1 - L\G n \ l / 2 exp(-Cs/T). 

Note that taking s > (log|G n |) 2 r, as the choice in Table ^ does, gives 
P[T„] > \-oi\Gn\~ 1 ). 

Proof. By Lemma l4~Tl any point is a local cutpoint with probability bounded 
away from 0. Whether or not Xt and Xt+2r are local cutpoints are indepen- 
dent events, so the probability of having no local cutpoint in an interval of 
length 2kr decays exponentially in k. The result then follows from summing 
over all start times t G [0,T — s]. □ 

Repeating the argument that the the probability of a point being a local 
cutpoint is bounded below, but considering times that are locally retained 
rather than local cutpoints, shows that the probability that any given time 
is locally retained is at least (1 + oil))/ 9. In particular, this gives 

Corollary 4.2. Suppose that {G n } is a sequence of vertex transitive graphs 
satisfying J^j with constant 9, and suppose that s = s(n) and q = q(n) are 
such that q < |G n | 1 / 2 ~ <5 / 4 and s = o{q). If {Xt} is simple random walk on 
G n , then 

(19) E \\LE s (Xt)Us\ ] - q + ° {,,) 



> 



In addition to knowing that a positive fraction of the walk is retained by 
local loop-erasure, we also want to know that the probability of two random 
walks intersecting is not too greatly reduced by local loop-erasure of one of 
the paths. For two i.i.d., transient random walks {Xt} and {^}, this holds 
quite generally. For example, 

(20) P[LE(X t ) n {Y u } ± 0] > 2- 8 P[(X t > n {Y u } + 0] 

(see |14j). Markov chains with geometric killing on a finite state space are 
transient Markov chains, but 1)20(1 does not apply in our case, partly because 
we need deterministic (rather than geometric) killing, but mostly because 
we are interested in cases when the killing times are on different orders of 
magnitude and so the killed walks are not i.i.d. 

Lemma 4.2. Suppose that {G n } is a sequence of vertex transitive graphs 
satisfying with constant 9. Let {Xt} and {Y u } be two independent ran- 
dom walks run with deterministic killing times Tx and Ty respectively, run 
from uniformly chosen starting locations x,y £ G n . IfTx,Ty < \G n \ 1 ^ 2 /2 
and s < e\G n \ l l 2 , then 

1 



(21) p[L E>( x t )S„n { rja ^ q > |G J^;_ 3) 



l + 2e 



2(1 - V~e) 
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where 9 is the constant in J^j. 

Proof. The proof relies on the second moment bound 

for non-negative random variables Z. For i < Tx and j < Ty, let Jij be an 
indicator random variable for the event 

{Xi = Yj G LE s (X t )£*}, 

and take Z = £V j Jij to be the number of ordered pairs (i, j) corresponding 
to intersections of LE S (X^) with {£n}. 

To bound EZ, let T\ and T<i be independent geometric random variables 
with mean (1 — A) -1 = (e|G n |) 1//2 , and let U denote the set of times < t < 
Tx that are locally retained. As {Xt} is symmetric, we can extend it to a 
doubly infinite chain (X t }'^ = _ 00 . Since Yq is uniformly distributed, 

> ^ (p[<^t>j = _3? n ( x *)£i = 0] - 2p [^ < S J 

Using Lemma 14.11 and observing that 9 > 1 yields 



EZ>^ 
" 9\G n \ 



' 2(Vi) 



_l + 2e 

To bound the second moment, note that the number of intersections of 
the local loop-erasure starting at x with the path from y is bounded above 
by the number of intersections of the original walk, so it suffices to bound 
the second moment of the number of such intersections. To do so, let 7y be 
an indicator random variable for the event {Xi = Yj} and fix a basepoint 
o £ G n . Because the starting points of our walks are uniform, 

Tx T Y T x T Y T x T Y 

Ez2 z E E E E 4E/ ^ - 3 E E E/ « 

j=0 j=0 k=i £=j i=0 j=0 

T x T Y T x T Y 

= ££££ 

i=0 j=0 k=i l=j 



4F [X k _ i+e ^ = o] _ 3T X T Y 

G n G n 



4TVTV^J^ 3T X T Y 

1 n| k=o e=o 1 n| 
TV TV 

l tT n| 

where 9 is as in (J2J). Using these quantities to lower bound (EZ) 2 /E(Z 2 ) 

gives the desired result. □ 

Let Cap r (5) be as in Definition ^ Taking Tx = q — Ss and Ty = r gives: 
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Corollary 4.3. Suppose that {G n } is a sequence of graphs satisfying the 
assumptions of Theorem \1.2L If q,r < \G n \ 1 ^ 2 ~~ s ^ 4: , and s = o(q), then for 
simple random walk {X t } on G n , 



(22) 



E 



Cap r (LE s (X t )S s ; 



> 



gr(l + o(l)) 
\G n \e 2 (46-3)- 



5. Tight concentration of longer segments 

We now combine the results of the previous section with large deviation 
estimates to show that |LE s (Aj)| and Cap r [LE s (^4j)] are tightly concen- 
trated about their means. We will do this by viewing A, as a union of 
smaller pieces and using the following large deviation bound: 

Lemma 5.1 (Hoeffding). Suppose {Z{\ are a family of independent random 
variables such that < Z* < b. Then 



(23) 



,i=i 









> nt 


< exp 


N)1 



This lemma is from jS], Theorem 1, equation (2.3), and is proved using 
standard arguments. 

Our applications of this large deviation bound include showing that, with 
high probability, the capacity and length of LE s (j4j) are close to their means 
for all Ai, as well as the fact that for different i, the sequences LE s (Aj) are 
relatively far apart in the graph. Much of the difficulty in doing this for 
Cap r [LE s (Aj)] involves the possibility that a run of length r of a walk might 
hit more than one subsegment of LE s (Aj). These multiple intersections are 
important because they mean that capacity is subadditive. 

Definition 5. Let {Y u } be simple random walk on a graph G, and let T$ = 
min{u > : Y u £ S} denote the hitting time for S. For sets U and V , the 
closeness of U and V is given by 

Close(C/, V) = Fn[Tu < r,T v < r]. 

Note that Close(L r , V) will be small if U and V intersect in a single point 
yet are otherwise very far away. On the other hand, if U and V are disjoint, 
but V is a translation by a small fixed distance, then Close(£7, V) will be 
large. Closeness is primarily a measure of whether or not typical points of 
U are near V , and vice versa, and is maximized when the two sets coincide. 

In the case when V is a segment of a random walk, Close(?7, V) is a 
random variable whose mean is bounded by the following lemma: 

Lemma 5.2. Let {G n } be a sequence of graphs satisfying the assumptions 
of Theorem \l.S\ and let r = r{n) be a sequence of positive integers. Let {Xt} 
be a random walk on G n started with Xq = x, let T > be a random time 
that is independent of {Xt}, and let V = {X t }J =T . Then for any set U and 
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starting position x, 
(24) 



E Close(£7, V) < 4 



rC&p r [U]ET 
\G n \ 



Proof. Let {Y u } be a simple random walk on G n started in the stationary 
distribution tt. By the strong Markov property, for any fixed set W, 

Close(C7, W) < F W [T V < r]Py T[/ [T w <r\T v < r] 

+ ¥ n [T w < r]¥ YTw [Tu<r\T w <r]. 
To apply this to our random V, let E x denote expectation given Xq = x. 



E, 



\[T v <r]Fy [Tu<r \T V < 



Ex 



J2 F *l T V < r > Y Tv = y]Vy[Tu < r) 



^P y [Tu < r]E x [F n [T v < r,Y Tv = y]] 



oo r— 1 



< F y\ T u < r \Y.Y. E - F -[ y « = X t = y,t<T] 



t=T U=0 



r _ n 2rET 2rETCap r [C/l 
Likewise, E x . \Fy Tu [Ty < r \ Ty < r]J is bounded by 

r— 1 oo 

JX^tv = y | TV < r] ^^p[y u = x t ,t < t I y = y ,x = x ] 



<Y,K[Y Tu =y\T u <r] 



U=0 t=T 

2rET 2rET 



I Gn | | G n I 

where the second inequality uses the fact that as t > r, 
F[Y U = X t \Y = y,X Q = x] < 2/\G n \. 
As P T [T[/ < r] = Cap r [f7], this completes the proof. 

Let a = a(n) and 7 = 7(71,) be as in (fTTj) . implying that 



□ 



E|LE s {X t }SJ= 7 r, 



ECa Pr [LE s {Xj 



r— s ] 
t=2s\ 



ar 

\G n 



Lemma 5.3. Let {G n } be a sequence of graphs satisfying the assumptions 
of Theorem M. OA and let {X t } be simple random walk on G n . Suppose that 
r = r{n) and s = s(n) are sequences such that 

4 S 1 / 2 |G n | 1 / 4 log|G n |<r<|G n | 1 / 2 - 5 / 4 
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Taking Ai as in {1$ , then whenever \G n \ > 2 s , 



(25) 



(26) 



\LE s (Ai)\ -yr 



Ca Pr (LE s (^)) 



/S\l/6" 




/r\ V 6 " 


>*(;) 


< exp 











ar 



\Gn 



> 



|G n |V2 J 



< 9 



,7/4 



|Gn| 7 / 8 



Moreover, a and 7 satisfy the bounds 

l + o(l) 



(27) 



7> 



a > 



l + o(l) 



2 (40-3)' 

The situation that we are most interested in is when s and r are as in 
Tabled i n which case assumption shows that Lemma 15.31 applies. More 
generally, note that the assumptions on s and r imply that as \G n \ — > 00, 
s = o(r) and also s\G n \~ 1 / 2 = o(r 2 \G n \~ l ). 

Proof. Let q = [{rs) 1 ^]. To prove all three parts of this lemma, we will 
break the interval A{ down into \r/q\ smaller pieces Bi^ whose length is 
asymptotically q, and use Lemma 15.11 and the results of Section 0] Let 
Bi t k = Z n [ir + kq + 2s, ir + (k + l)g — s], and denote 

LE s (S i)fc ) = (X t ) t( zB itk r)U, 

where U is the set of times that are locally retained. 

We begin with (|25|). Breaking LE s (Aj) into pieces and summing gives 

[r/aj-l 

|LE s (A,fe)| < |LE S (A)| 



fc=0 



< J] \LE s (B i>k )\+3s(- + l\ . 
k=o ^ q ' 



By our choice of q, rs/q ~ r(s/r) 1 ^ 2 , which is of a lower order than rfs/r) 1 ' 6 . 
The restrictions on r and \G n \ insure that 3s(r + q)/q < r(sfr) 1 ^. The spac- 
ing between the B{^ is such that |LE s (.Bjfc)| are i.i.d, so applying Lemma 
l5~T1with Z k = |LE a (jB ifc )|, b = q, and t = q^r' 1 / 3 shows that 



'[r/qj-1 

(Zk-EZ^yris/r) 1 / 6 

k=0 



< exp 



< 



exp 




The argument for (|26|) is similar, but the naive upper and lower bounds are 
farther apart. The capacity is bounded above by the sum of the capacities 
of the pieces, plus a little extra since the B^ are spaced 3s steps apart in 
time. Summing the capacity of the pieces overcounts by the probability of 
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hitting at least two pieces. For a lower bound, we subtract the probability 
of having double hits. 

[r/aj-l 

Cap r [LE.(B i , J fe)] - Close(B itk ,B itj ) < Cap r [LE s (^)] 

k=0 k,j,k^j 

[r/qi 



< Ca Pr [LB s (B itk )] + 3 I - + 1 ) 
k=o \1 / 



sr 
\G n \ 



Applying Lemma 15.11 with Z k = Cap r [LE s (i?.j k )], b = qr/\G n \, and t 
gives 



9/4' 



< exp 



1 r 3/2 



2s|G n l 1 /4 



which is at most (l/2)|G n | 2 < r 7 / 4 |G n | 7 / 8 by the assumptions on r. By 
Lemma 15.21 



E Closed, < 4 
By Markov's inequality, 



2 22 
r \ o r 

+ 1* ' 



]T Closed, Bij) > 
This bound proves l|26|). 



,9/4 



2|G n |9/s 



Gr, 



< y (r 7 / 4 |G n |" 7 / 8 



Finally, the lower bounds of (J2ZJ) are those of (JTHI) and 

6. Completion of the coupling argument 



□ 



The aim of this section is to prove Lemma 12.21 First, we will show that 

h'E(Xt)J =0 can be decomposed into runs of length r without losing too much 

information, then we will prove two lemmas that allow us to couple random 

walk on G* T with K* ~ for suitable m and L, and then we will end by 

m < L 

proving Lemma l2.2l 

Given T, r and s, a time index i < i = \T ' jr\ is called good by time T if 

{Xt}teAi n [{X t } t < ir U {X t } te [(j +1 ) nT ]] = 0, 

where is as in (|14|) , Intuitively, this means that there are no loops longer 
than length s with one endpoint inside A{ and one outside, although there 
are some slight differences at times near the endpoints of Ai. Similarly, a 
time index i < I is called a single intersection at time T if there exists j < £ 
such that {X t } teAi n {X i } teAj . / 0, but {X t }< eAi n {X*} te/ = 0, where 
I = {[0, T] \ ([(i - l)r, ir] U [(j - l)r, jr})} n Z. 

Finally, a time index is called 6ad if it is neither a single intersection nor 
good. Let Bt denote the collection of time indices i < £ that are bad at 
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time T, let Ct be those that are single intersections, and Qt be those that 
are good. 

Definition 6. Given T,r and s, let I = \T/r\. A run (X t )J =0 is called 
locally decomposable if: 

(1) B T = 



(2) for all i < i, 

(3) for all i < i, 



|LE s (^)|-7r <2r(f) 1/6 

Ca Pr [LE s (A)] - ar 2 \G n \~ l \ < r 9 / 4 |G n |" 9 / 8 

(4) There is no pair (t,u) £ 1? with t,u £ [0, T] and \t — u\ £ [r, r] such 
that Xt = X u . 

(5) There is no pair (t,u) £ Z 2 with t,u £ [0,T] and \t — u\ > s such 
that t £ UiAi and X t = X u . 

(6) For all t £ [0, T — s], the interval [t,t + s] contains a local cutpoint. 

Lemma 6.1. Let {X{\ be simple random walk on G n and T a geometric 
random variable with mean LIG^I 1 / 2 that is independent of {Xt}. Sup- 
pose that r and s satisfy the hypotheses of Lemma 15, M and also that s > 
r(log |G n |) 2 . Then the probability that (Xt)f =0 is locally decomposable is 
1 - C»(L 2 r 3 / 4 |G n |- 3 / 8 ). Moreover, 

(28) E[B T UC T } = 0(L 2 ). 

When s,q, and r are as in Tabled condition © implies that Lemma 16.11 
applies and also that r 3 / 4 1 C„ | ~ 3 / 8 = o(|G„|- 3<5 / 16 ). 

Proof. Counting the expected number of long loops involving LE s (Aj) gives 

27V 

F[i£B T UC T | T] < ——. 

\G n \ 

As I < (T/r) + 1, this implies that 

2(T 2 + rT) 



(29) E[\Bt U Ct\ I T] < 



\G n \ 



Using the restrictions on r and taking the expected value of (|29|) yields (|28[1. 

Likewise, the probability that an index i is bad is bounded by the expected 
number of pairs j and k such that {Xf}teAi intersects both {-^t}fj-iy<i<j> 
and {Xt}( k _^ <t<k , plus the expected number of times such that {X t }teAi 
intersects the gaps between the Aj, yielding 



Qsr 
\G n \ 



2(T + r) 2 r 2 6(T + r)s 
— \n 12 ' \n l ' 



and as ET = L\G n \ l l 2 and s\G n \- 1 ' 2 = o^G^- 1 ), we see that E|0 T | 
O (^rlGnl" 1 / 2 ) . In particular, F[B T = 0] = 1 - 0(L 3 r|G n |" 1 / 2 ). 
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j < i 



-i 



< r 9 / 4 |G n |" 9 / 8 , j<i 



|LE a (Aj)|-7r 
Cap r (LE a (il i ))-cw 2 |G ! r. 
Using Lemma O and the facts that ^ = [T/r] and ET = LjCn] 1 / 2 , 



\At I T > 1 - 10- 



p[^] = l _ O L 



r |G n | 7 /8' 

r 3/4 \ 



G n | 3 / 8 



Let r n denote the event that for all t £ [0,T — s], the interval [t,t + s] 
contains a local cutpoint. By Corollary 14,11 there exists C so that P[T re ] = 
1-LlGnl 1 / 2 exp(-Cs/r), which is 1 - 0(Lr 3 / 4 |G„|~ 3 / 8 ) by the lower bound 
on s. 

To bound the probability of Condition 0] of Definition El holding, condi- 
tioned on T, the expected number of loops of length in the interval [r, r] 
is 

T i+r 



2Tr 
\G n \ 



i=0 j=i+r 

As ET = L\G n \ 1 / 2 , Condition H holds with probability 1 - 0{Lr/\G n \ x f 2 ). 

For Conditional conditioning on T and counting the expected number of 
pairs t and u that we wish to avoid bounds the probability of failing by 



3s 



+ 1 



IT 
\G n \ 



so Condition holds with probability at least 1 — 0(sL 2 /r). Combining all 
six of these bounds proves the claim. □ 

For a finite tree T, call a vertex v G T a leaf if the degree of u is 1. 

Lemma 6.2. Suppose T and T are trees with k leaves {x\, . . . , x^} and 
{yi, . . . ,yk} respectively. Let £ and r) be uniformly chosen from T and T 
respectively. There exist constants C\ , C 2 depending on k such that if for 
some e > and R > 1, \d(x,i,Xj) — Rd(yt,yj)\ < ed(xi,x 2 ) for all i,j < k, 
then there exists a coupling of £ and rj such that 



(30) P V» < k,\d(Z,Xi) -Rd(7], yi )\ < C x [ed{x x , x 2 ) + R] 



> 1 - C 2 e. 



Proof. In the case when k = 2, take \x to be a uniform random variable on 
[0, 1], take £ so that d(^, x\) = \ji\d{xx,X2)-\-^\\ , and take rj so that d(rj, yx) = 
l/j,[d(yi,y 2 ) + 1]J- We then have \Rd(rj,yi) - d(£,xi)\ < ed{x\,x 2 ) +R + 1. 
As R > 1, this shows that we can take C\ = 2 and C 2 = 0. 
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Proceeding by induction, assume that we know the result for k = n. Given 
a tree with n + 1 ends, the vertex x n +i is connected to the subtree spanned 
by {xi, . . . , x n } by a segment of length 



A 



mm [i 

i,j<n 



(xi,x n+ i) + d(xj,x n+ i) - d(xi,Xj)]. 



Likewise, let A denote the length of the segment connecting y n +\ to the tree 
spanned by {yi, . . . , y n }. By repeated application of the triangle inequality, 
our assumption on the distances shows that \A — RA\ < 6ed(xi, x 2 ) and also 
that 



F[d(£,x n+1 ) < A] - T[d(rj, y n +i) < A] 



< Ce 



for some constant C = C{n). If both £ and rj are chosen on the segment 
connecting x n+ \ and y n +i to the rest of the tree, then we can couple them 
so that \d(£,x n+ i) — Rd(rj, y n +i)\ < 2i? + 6ed(x\, x 2 ), and so by the triangle 
inequality, for all i < n, — Rd{rj,y,i)\ < 2R + 8ed(x\, £2). If both 

are in the rest of the tree, then the induction hypothesis shows that with 
probability 1 — C2(n)e, we can couple them so that \d(^,Xi) — Rd(rj,yi)\ < 
Ci(n)[ed(xi, X2) + R] for all i < n, and thus by the triangle inequality, 
\d(£,x n+ i) - Rd(r/,y n+ i)\ < (Ci(ra) + 6)ed(xi,x 2 ) + C\{n)R. We have thus 
proved the lemma with C\(n + 1) = Ci(n) + 6, Ci(l) = 2, and C2(n + 1) = 
C 2 (n) + C(n), C 2 (l) = 0. □ 

We now begin constructing the coupling. Let G* T and K* ~ be as in 

o r- o n,L m i 

Lemma 12.21 with m and L given by 

\G n \ 



(31) 



m 



and 



(32) 



L 



1 



1 



For fixed L, note that La 



L\G n \V 2 
L as n — > 00 since r = oQGnl 1 / 2 ). 



Lemma 6.3. Suppose that {G n } is a sequence of graphs satisfying the as- 
sumptions of Theorem \1.°A and r = r{n) and s = s{n) are sequences of 
constants satisfying the assumptions of Lemma \6.1\ For fixed L, let {X{\ be 
weighted random walk on G* L with Xq chosen uniformly from G n . Taking 

m and L as in and \3°J\) . let {Y u } be weighted random walk on K* ~ 

with Yq £ K m , and let Si and Si be as in hl'J\) and Mb}) . Suppose that 
Tfc C G* L and C K* ~ are such that p 6 7^ and p £ T^. Then there 
exists a coupling of {X{\ and {Y u } such that 
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Ca Pr [T fe nG n ] 



\% n K„ 



m 



where T is the hitting time of T k and T is the hitting time of . 

Note that for s, q and r as in Tabled when (j3J) and © hold, the hypothe- 
ses are met and also that the bound obtained is 1 — 0(\G n \~ s / 32 ). 

Proof. To establish the coupling, we will use the fact that while the distri- 
bution of Sj depends on more than just <Sj_i, the fact that Y u is uniformly 
chosen from K m means that the distribution of Sj depends only on the 
value of Sj-i. To this end, we will run the process {X t } on G* n L , use that 
to evaluate the indicator random variables {Iy}ieSj-i (and hence compute 
Sj), couple the indicators {Iij}ieSj-i with {L j j}j & s j _ 1 , and use the fact that 
we can construct Yj given the values of the indicators {iij} ie §. 1 - 

We need to show two things: first, that we can couple \T/r\ and T, and 
second, that we can couple the walks until these hitting times. 

To couple the hitting times, let T p denote the first time that Xt = p, and 
note that T p > T. Until time T p , the path of {Xt} is a simple random walk 
on G n that is independent of T p , so a first moment estimate gives 



F[T <£ uAi | T p ] < 



\T k \3s 
\G n \ 



T 



As ET p = LlGnl 1 / 2 , this yields F[T $ UAi] = O {Ls^r'^Gn^ 1 / 2 ) . 
Likewise, let T p denote the hitting time of p. By the definition of L, 



F[T = j\T>j- 1] 



\T k n K n 



1 



+ 



1 



771 



1 



+ 1 



1 



1 



L|G n |V2 



L\G n \V 2 

To estimate what happens on G* L , we will use the fact that s is much 
larger than r. In particular, as the L°° distance between the distribution 

of {X t } and the stationary distribution tt, given by sup x p t (x)\G n \~ 1 — 1 , 
is submultiplicative (see e.g., JSJ Proposition 2.2), we have = v\ = 

[1 + 0{2- s / T )]/\G n \, which is "[1 + oflGnl- 1 )]/^! by the lower bound on 
s. Using this to correct the fact that our walk is not quite uniform in 
distribution gives 



Ca Pr _ 3s (T fe n G n ) [1 + 0(2- s / T )} + 1 - 

>¥[{X t ) teA4 n % ± | T > (j - l)r - s 
Cap r _ 3s (T fc n G n ) 



> 1 



1 

L\Gn,\^/ 2 



3s 



1 + 0(2- s / T ) 



1 



+ 1 



L\G n \V 2 



-3s 



1 



r— 3s 
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Combining the estimates on G* T and K* ~, we obtain 

n ' n m,L 



V[(X t ) teA . n % ^ | T > (j - l)r - s] - P[T = j \ T > j - 1] 



< 0(2^) + O ( * /2 ) + |Ca Pr _ 3s (T fc n G n 



IT, n 



m 



Note that Cap r [7fc] — Cap r _ 3s [7fc] = 0(s|7fc| |G n | _1 ) by considering the 
expected number of intersections in the interval [r — 3s, r]. Using the lower 
bound on s to bound 2~ S / T and the fact that T is bounded by a geometric 
random variable with mean L\G n \ 1 / 2 , we have thus shown that the hitting 
times can be coupled with the claimed probability. 

We now show that, with high probability, that the two walks are coupled 
up until time T. Consider the event that the first step at which the coupling 
breaks is step j, which in turn is bounded by 

(33) P[5,-_i =S j - 1 ,S j ^Sj]. 

To bound ()33j) . we will bound the total variation distance between the 
joint distributions of {Iy}ies 3 -_i an d {lijljgs. 1 conditioned on the event 

Sj-\ = Sj-i as well as the past of the walk {^Q}t< r (j-i)- To do so, consider 
three types of cases for the values of {Iy}ieSj_i • 

Let Case 1 be the case when no long loop is formed at step j, meaning 
that we need to bound 



(34) 



[lij = OVi G £,■_!] 



= OVi G 



Conditioned on Sj-% = 



[I 



0\/i G 5 



15, 



'3-1 1 

m 



Again using a factor of 1 + 0(2 S I T ) = 1 + o(|G n | 1 ) to correct for the fact 
that our walk is not quite uniform in distribution, 



1- Y, Ca Pr _ 3 jLE s (^)] < Ppy = OV* G Sj-xWl + o{\G n \~ 1 )} 



K ieSj- 



i-l 



<1- i Yl Cap r _ 3s [LE s (A,)] + Yl Clo B e(LE.(i4 j ),LE.(A / )). 

Combining these and using the triangle inequality, we see that is, up 

to a factor of 1 + o(|G n | -1 ), bounded by 

(35) 

J2 Close(LE s (A i ),LE s (A j ))+ £ Gap P _ 3 .[LE a (i4 j )] - -j- 
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Let Case 2 be the case when there is a long loop formed involving exactly 
one Ai, meaning that we need to bound 

(36) |p[Iy = 1, l kj = k G S^ x \ {i}] - P[Iy = 1, l kj = k e \ {i}] . 
But 

Cap r _ 3s [LE s (A;)] > Ppii = = OA; e <Vi \ {»}](1 +o(|G n |- 1 )) 

>Ca Pr ._ 3s [LE s (A;)]- ]T Close(LE s (A;),LE s (,4 fc )) 

fees 3 _i\{i} 

and P[Ijj = = OA; G <5j-i \ {i}] = — , so (|3*6*)l is, up to a factor of 

1 + o(|G n | _1 ), bounded by 



(37) 



Ca Pr _ 3s [LE s (A;)] - - 



+ Close(LE s (Ai),LE s (A fc )). 

fce5j_i\{i} 

Let Case 3 be the case that two or more of the l are 1. The 

probability of this is on K m and on G n is at most 

(38) Close(LE s (A),LE s (40). 

i,kGSj — i,i^k 

Summing (|37j) over all i in and adding to (|35l) and (|38|) gives an up- 
per bound for the total variation distance between the joint distributions 
of {LjlteSj-i and i conditioned on = Sj-i and the path 

{Xt} t <{j~i)r of, U P to a factor of 1 + o(l), 



Cap r „ 3s [LE s (A;)] - - 
m 



3 Yl Close(LE a (A),LE a (A fc ))+2 ^ 

Taking expectation to remove the conditioning, Cap r _ 3s [LE s (Aj)] < r 2 /|G n | 
by equation ©. Using Lemmal5~21 E Close CLE „ f A- V LE S (A%)) < 4r 4 /|G n | 2 . 
Since T p is independent of the path of {Xt} for t < T p , 

E E E Close(LE 8 (A),LE 8 (A))< 1 ^ J 

j=l i,k£Sj-i,i^£k 

o- rL " 



As 



\G n \^ 

a(r — 3s) 2 |G n | _1 — m _1 = 0(r 4 \G n \~ 2 ), Lemma [5.31 and the bound 



from (El) show that 



E 



Cap r _ 3fl [LE s (Ai)] 



m 



< 



IGJV2 



9/4 



o 



IGJV2 



+ 9- 



9/4 



r 2 r 7/4 
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This leads to the bound 

\t p M 

3=1 ieSj-i 



Cap r _ 3s [LE s (^)] " - 



< 



ET 2 



-O 



r \ 9/4 
\G n \V*) 



O 



lV /4 



which completes the proof. □ 
We now use Lemmas Ifi. 1 1 IfTHl to prove the induction step of Lemma l2.2l 



Proof of Lemma \2. 2l We proceed by induction. Let {X t } be a weighted 
random walk on G* L with X$ = Xk+i, and let T denote the hitting time of 
7^. Take q,r and s as in Tabled and assume that Lemma 12.21 holds for k. 
As before, take t = \T/r\ . If i £ £?t H and the path (Xt}J =0 is locally 
decomposable, then LE^Aj) C LE(X t )^ =0 and, in particular, 

(39) |LE S (A)| < |LE(X t )f =0 |. 

Moreover, when (Xt}J =0 is locally decomposable, the only difference between 
Ui e s £ LE s (Aj) and LE(X t )^ =0 are from the gaps of length 3s between the 



final run h'E s (Xt)J_^_ 1 y. Thus, when (Xt)J =0 is locally decomposable and 



A4, the runs that were erased during the single intersections Ct, and the 

LE,(X 
S £ = S^x * {£}, 

(40) |LE( 



(X t )f =0 \ < £ |LE.(^)| + (^-) + r(\C T \ + 1). 



Combining (JHHJ and (gOJ), 

LE(X t )^ =0 |- 7 r|^|| < f— ) +r(|C r | + l) + ^||LE s (A 

By the definitions of Table Q equation (|2"5|) , and condition © 
~3sT 



)l -TT 



E 



E[r\C T \] = 0(L 2 r) = 

When (Xt)f =0 is locally decomposable, 

T + r 



o(|G n | 1/2 " 5/2 ) 
(|G n | 1/2 -* /4 ) . 



and 



^||LE S (A 



)| - 7r 



< 2 



S\l/6 

r ( - 

r ' 



o(T\G n \- s / 12 



As T is geometric with mean L|G n | 1/2 , for large enough n, this is with 
probability 1 - o(|G n | _1 ) less than 1/2-5/24 . Note that 

F[S £ = S f ^i * {£} | T) > Tr|G„| -1 , 
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and applying this along with Lemma Idll and Markov's inequality gives 



(41) 



\LE(X t )f =0 \ - -yr\S t 



P\G n \V* 

To compare with the complete graph, 



|LE(X, 



t/t=o\ 



0\G n \i/* 



< 



\LB(X t )f =0 \- 7 r\S e 



+ 15/ 



TO: 



1/2 



|G n |V2 



/3|G n |V2 

But [m -1 / 2 - ra 1 / 2 |G n | -1 / 2 ] = 0(r|G n | -1 ), and by Lemma 10} with prob- 
ability 1 — 0(|G„| -<5 / 32 ), \St\ = \Se\ is the length of the segment added to 
7^ to form Tk+i- Let £ be chosen uniformly from Tj. and according to the 
coupling of Then 

a; fc+ i) = d(xi,i) + |LE(X t )^ =0 | + X T ) - dfo, £)■ 
Using the decomposition 



d(xi,x fc+ i) d(yi,y k+1 ) 



P\G n \V* 



m 



< 



+ 



P\G n \V 2 v'"' 
d(yi,Y T ) 



+ 



d(xj,X T ) - d(xj,£) 
P\G n \V 2 



0\G n \V 2 

we have already controlled the first term, equation (jllj) implies that 

P[|d(xi,X r ) -d(xi,e)| > |G n | 1 /2-*/32] < 0( | Gn |-5/32 )5 

and applying Lemma E21 to couple £ and Yt, we see that there is a coupling 
such that (JUJIl holds with probability 1 - GOG^-" 5 / 32 ). 



To prove ©, we first bound Close(7fc, LE(X, 



t/t=o) 



By Lemma 15. 21 



(42) E[Close(T fc> (X m _ {) i 

Markov's inequality gives 



%] <4 



Lr Cap r Tfc 
IGJV2 



Closed, <X t )? 



Cap^rV 2 
> 4 IGJi/4 1 Tfc 



< 



Lr 1 ' 2 



The fact that LE(Xt)^_ C {X t }j?l and monotonicity of closeness, along 



with the fact that r 1 / 2 \G n \~ l/i = o(\G n \~ d/a ), implies that 

P[Close(T fc! LE(Xt)f =0 ) < \G n \~ s / 8 Cap r (T fc ) \T k ] = l- o(\G n \- s / 8 ). 

Now estimating Ca Pr [LE(Xt)j£L ], wnen (Xt)f=o is locally decomposable, 
£ Ca Pr [LE s (A)]- ^ Close(LE s (^),LE s (^)) 

< Ca Pr [LE(Xt)f =0 ] 

3sT\ r 2 (|C T | + l) 



< ^[Cap r LE s (^,)] + 



|G n | 



2S 
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In particular, we have 



< 



E 



Cap r [LE.(A)] - - 
m 



+ Close(LE s (A),LE s (^)) + 



3sT \ t r 2 (\C T \ + 1) 
\G n \ 



\G n \ 



The expected value of the sum of the last three of these terms is bounded 
by 



Ar 2 E(Tf 



+ 



3sL 2r 2 [E(T) 2 + l] 



+ 







\G r . 



\G 12 1 \rt 1 1/2 1 \n 12 

l^nl l^nl ' l^ral 

By Markov's inequality, the sum of these three terms is therefore less than 
\G n \- s / m with probability 1 - o{\G n \~ 5 ^ 2 ). On the event At, 



E 



Cap r [LE a (A)] 



1 

m 



< 



T + r 



,9/4 



|G n |9/8 



|G n | 



which is less than |G n |"' 5 / 20 (r/|G n | 1 / 2 ) with probability 1 - odG^I -1 ) be- 
cause T is geometric with mean L\G n \ l l 2 . By Lemma 16.11 the values of 
Tableland Condition (j3J), LE(X()^ =0 is locally decomposable with prob- 
ability 1 - o(|G n |- 3<5 / 16 ). By Lemma ESI and our inductive assumption 
that © holds with probability 1 — 0(\G n \~ 5 ^ 32 ) on with probability 
1-0(|G„|- 5 / 32 ), 



Cap, r (T fe+ i)- Cap f ,(T fc 



a 1 / 2 r 



< ( Cap r [LE(X i )/ =0 ] + Close(T fc , LE(X, 

ISfWGnl 1 / 2 



t/I=o) 



\G l 1/2 

V2 r 



O 



-0(\Gn\- S/24 ) 

-o(\G n \- S/24 ) 



+ 



+ 



ma l / 2 r 
Tk+i\ — \Tk\ 



and hence by induction Q holds with probability 1 — 0(\G n \~ s ^ 32 ). 

Finally, we need to show that the uniform measure \i and the hitting 
measure v on Tfc+i can be coupled in such a way that (JTT|) holds. Let £ be 
chosen according to ji and 77 according to v. Note that 



(43) 



We will use l)43JI as an upper bound whenever our procedure for constructing 
the coupling fails. To make this coupling, consider the event F that either £ 
and 7) are both in 7^ or both are in LE S (/LJ for some i £ Sg. If both are in 
7^ , we wish to use the induction hypothesis to establish the coupling. For a 
random walk (Y u ) started in the uniform distribution on G n , let T id denote 
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the hitting time of T k and T new the hitting time of LE(X;)^Lq. We know 
that with probability 1 - 0(\G n \- 5 / 32 ), that X Told can be coupled with a 
uniformly chosen point. But 



and so 



< r o i d < T new + r] < 



[X Tn] 



Close(T fc ,LE(X, 



t/t=o) 



0(r\G n \-^) 



C&p r [T k+1 ] 
r new < Tow] I < 0(2- r / T ) +0{r\G n \- 1 ' 2 ). 



x&T k 

This means that conditioning on hitting T k first has a small enough effect 
that we can still couple £ and rj with the claimed probability. If both £ and 
77 are in LE S (^4;) for i G </ T , then dr k+1 {C,v) < r = o(\G n \ 1 / 2 ). We thus 
need to show that P[F C ] = 0(|G n |- 5 / 32 ). 

To do so, by Lemma IrTTl (X t )J =0 is locally decomposable with probability 
1 - o(|G n |- 35 / 16 ). On the event At, 

„2 



Cap r [LE s (A 4 )] 
|LE S (,4 4 )I 



ar 



\Gn 



\l + 0(\G n \- 5 ' 1& )] and 



7r 



[l + 0(|G n |- 5 / 12 )] 



Ca Pr [LE s (^)] _ ra 



l + 0(\G n \- s ' m 



and so 

(44) |LE S (^)| 7 |G, 

Likewise, we have already seen that, with probability 1 — 0(|G n |~ 5 / 32 ), equa- 
tions © and (fTU|) hold on T k+ i, so 

Ca Pr (T fc+1 ) = \f k+1 \m[l + 0(|G n |- 5 / 32 )] and 

|T fc+1 | = |f fc+1 |(|G n |/m) 1 / 2 [l + 0(|G n |-'/ 32 )] J 

which gives 



Cap r [T fc+ i] 



ra 



l + O \G n \- 5 / 3 ' 



\T k+ i\ -y\G ri> 

By considering the total variation distance between [i and u, there is a 
coupling such that 



P[F C I T fc , (X, 



t/t=o. 



< 



Y, \fJ.\LE a (Ai)] - v\LE.(Ai)] 

r\C T \ 



ies e ng T 
+ \ f i(T k )-u(T k )\ + Q 



k+l\ 



+ 



sT 



r\Ti 



fc+H 



To use F[F C ] = EF[F C \ T k , (X t )f =0 \, note that the expected value of these 
last two terms is 0(\G n \~ s ^). For the other two terms, for any S, 



Hs)-v(s)\< 



fJL(S) 



Cap r (5) 



Cap r (T fc+ i} 



u(S) 



Cap r (5) 



Cap r (T fc+ i) 
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On the event At, (03) and ^ imply that for 5 = LE s (Aj) or % 




Moreover, 



Cap r (T fc+ i) ~ Cap r [T fc+ i] 



Cap r (5) Close(5,7fe + i \ S) 



By Lemma 15.21 



E[Close(LE a (Ai),r fc+1 \ LB s (A l )) \ T k+1 \ LE S (^)] 



.2 




n 



Combining (|46[1 with (|42j) . we thus obtain 




which is o (|G n | 5 / 32 ) as required. 



□ 



7. Stochastic domination of spanning forests by trees 

We now prove Lemma l'2.11 The heart of the proof relies on the stochastic 
domination of T by T*. Suppose that G is a graph with vertex set V and 
edges E. Let G\ denote the graph that is the extension of G formed by 
adding an additional vertex p, and from every vertex v E V, an edge (v, p) 
of weight 1 — A. Let T\ be a weighted spanning tree on G\ as generated 
by Wilson's algorithm with root vertex p. The graph T\ induces a forest 
T C G simply by restricting to edges in E f]T\. We will show that T is 
stochastically dominated by the uniform spanning tree T. 

An event A is said to be an increasing event on a graph G if for any 
subgraph a E A, if a' is another subgraph of G formed by adding edges to 
a, then a' £ A as well. We say that an event A is supported on a set of 
edges Ei if determining whether or not a is in A only requires looking at 
the edges E\. (Equivalently, if a € A, then a' G A, where a' is the subgraph 
whose edges are in both a and Ei). 

Lemma 7.1 (Feder and Mihail). For increasing events A and B supported 
on disjoint edge sets of G, P[T G A \ T E B] < P[T G A]. 

Let Ei and Ei be disjoint edge sets such that A is supported on Ei, B is 
supported on E2, and Ei U E2 = E. The case when \Ei\ = 1 was originally 
proved by Feder and Mihail (0, Lemma 3.2), and they remark that iterating 
their proof implies that the general case is also true. A proof of the general 
case appears in the solution to Exercise 8.10 in |13j . 

Fix integers Mij > 1 for i,j < k. Let B denote the event that the degree 
of p is at least 2, and let A denote the event that there is a path of length 
exactly Mij in G from x% to Xj for all pairs i,j < k. Clearly the event A 
is increasing and requiring that the path be in G means that the event is 
supported on E, the original edge set of G. On the other hand, the event B 
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is supported on edges from G to p. As adding edges from p to G increases 
the degree of p, the event B is increasing. There are no loops in a forest, so 
the event A implies that d^(xi,Xj) = M^j for all 

By Lemma l7.1[ the events A and B are negatively correlated. Moreover, 
B c is equivalent to T being a spanning tree. In particular, conditioned on 
B c , T is equal to T in distribution. This shows that the probability of 
having the right lengths in T is a lower bound for the probability of having 
the right lengths in T, i.e., for any collection of values Mi a < oo, 



(47) F[df(xi,Xj) = MijViJ < k] < W[d T (xi,Xj) = M i%j \/i,j < k]. 



Using the convention that d^(x{,Xj) = oo whenever x\ and xj are in different 

components of T, the total variation distance between the joint distribution 
of d^-(xi,Xj) and the joint distribution of dr{xi,Xj) is bounded by 



The key step is now controlling the probability that {x±, . . . , x^} are in 
the same component of T. 

Lemma 7.2. Suppose that {G n } is a sequence of graphs satisfying the as- 
sumptions of Theorem \1. 6 A For any e S (0,1), let {Xt} and {Y u } be two 
independent random walks on G n , with (possibly different) starting points x 
and y, and let Tx and Ty be independent geometric random variables with 
mean e~ 2 1 G n \ 1 / 2 . Then 



Proof. Let s and r be as in Tabled For any set S C G n , let Tg denote the 
time {Y u }™ =0 first hits S. If Cap r (5) > re/\G Tl \ 1 / 2 , then 



F[T Y <T S ]<Y^ P[*r < T Y < (i + l)r | T s , T Y > ir]F[T s , T Y > ir] 



P[3 : df(xi,Xj) = oo]. 



(48) P[LE(X t )^ n {Y u }iL ^ 0] > 1 - 2s - a(n, e), 

where a(n,e) — > b(e) as n — > oo and b(e)/e — ► as e — > 0. 



j=0 



oo 




P[T 5 > ir]F[T Y > ir] 



i=0 




e 2 r 2|G n |V 2 
G n |V2 re 



= 2e. 



The proof thus reduces to finding a(n, e) such that 



P Ca Pr [LE(A^ ] > 



re 



> 1 — a(n, e). 
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Let T = (2e/a)\G n \ l l 2 , where a is as in (|17|). Since Tx is geometric with 
mean e~ 2 |G n | 1//2 , we have P[Tx < T] < 2e 3 /a. Restricting to the event 
{Tx > T}, we will now view the path (Xt)J* as being in two parts: an 
initial run of length Tx — T and a final tail of length T. The idea of the 
proof is that if the initial segment has a large capacity, then because the tail 
is short, it probably misses enough of the initial segment that the capacity 
remains high. Conversely, if the initial segment has low capacity, then the 
tail will probably survive and is long enough to be of high capacity itself. 

More formally, consider two possibilities based on whether or not the 
event 

(49) {Cap r [LE(X t )^ - T ] > 2er/\G n \ 1 / 2 } 

occurs. Let (r)[k)) be an increasing sequence such that LE(Xt) t j5 = 
{Xrjffy}. When (fl9|) holds, let M be the smallest number such that 

Cap r [(X v ( k -)) k <M] > , G , 1/2 - 

Denote this initial segment by U := (X v ^)k<M- Because capacity is subad- 
ditive, M > elGnl 1 / 2 and 

er _ TT er r 2er 

< Cap r U < — l1/9 + — — < 



|G n |V2 ^ r |G n |V2 \ Gn \ |G n |V2- 

Subadditivity of capacity also implies that Tx—T—M > e|G n | 1//2 — 1, which 
for large enough n is greater than s. But 

P[{^t} t r ir" T -T n u + 0] < e TOfffJ+T n {x t f x ~ T - £lGnll/2 



< 



L X -J Jt=0 

2sET x 2s 



Subdividing the final T — s steps of the walk into pieces of length r, using 
the fact that Xt x ~t+s is close to uniform, and considering the expected 
number of those pieces that intersect U gives 

P[W}^„ T+ ,n^e]<^f 

If _ T n C7 = 0, then C7 survives loop-erasure, so this shows that, 

conditioned on (|49|) . with probability 1 — a(n,e) for a(n, e) of the suitable 
form, ?7 survives loop-erasure and yields the desired capacity for LE(X 4 ). 

When (gnj) fails, we have Cap r [LE(X t ) t<Tx _ T } < 2er/\G n \ 1 / 2 . The prob- 
ability of a segment of length T started at uniform intersecting this initial 
piece is then 0(e 2 ) by the same argument as before. Adding a buffer of s 
steps to get close to a uniform position, we thus have 



P[{X t } t>Tx . T+s n LE(X t )Jx- T = 0] = 1 - 0(e 2 ) - (1/2 
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Moreover, T is small enough such that the expected number of loops longer 
than r within the final T — r steps is bounded by 4e 2 /a 2 . Thus with prob- 
ability 1 - f(n,e), where f(n,e) = 0(e 2 ) + (1/2)L S / T J, LB(X t )J^ Q contains 
W = LE s (X t )J* Tx _ T+s 



Ca Pr (LE(X, 



th=o) 



and so 

re 

> 



\G I 1 / 2 



> 



Cap r (W) > 
Cap r (W) > 



re 



\G I 1 / 2 
a Tr 



f{n,e) 



f(n,e). 



2 \G n \ 

By again breaking the final T — s steps up into runs of length r, using 
the concentration of capacity about its mean on such a run, and bounding 
the closeness between these runs as in the proof of Lemma 12.21 this final 
probability is of the form a(n,e) as required, with 6(e) = 0(e 2 ). □ 

Proof of Lemma \2. 11 As mentioned above, we need to bound 
(50) F[3(i,j) :d f (x l ,x J ) = ooi 

and in particular show that it is less than e for sufficiently large n and L. 
But using Lemma 17.21 gives 

k 2 

P[3 : df(xi,Xj) = oo] < —F[df(x 1 ,x 2 ) = oo] 

2 



< 



2 

k 2 



+ a{n,L- 1 / 2 ), 



2 LV2 

where a(n,x) is as in Lemma 17.21 Taking L = k 4 e~ 2 bounds (|B0jl by e + 
a(n, ek~ 2 ), which for small enough e and large enough n yields the required 
bound. □ 



8. Constants on the torus 

This section is devoted to proving Theorem ll.il To see that satisfies 
the hypotheses of Theorem 11.21 note that for simple random walk on 
with holding probability 1/2, \F [X t = o] - n~ d \ < Cr d/2 for a suitable 
constant C (see e.g., jl], Chapter 5). This means that the only way in which 
Theorem 11.11 is not a special case of Theorem 11.21 is that there is a single 
rescaling constant (3 rather than a sequence of constants (3 n that (possibly) 
depend on n. Thus, what we need to show in this section is that lim/3 n 
exists. We will do so by giving an expression for the limit. 

Lemma 8.1. Let {Y{\, {Z u }, and {W v } be independent simple random 
walks on 7L d , all starting at the origin. Let G n = and take a = a(n) and 
7 = 7(71) to be as in jl7j) . Then for d > 5, 

(51) lim 7 (n) = F\LE(Y t )£ n {Z u }™ =1 = 0] 



(52) lim a(n) = 

n— >oo 
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LE(y t )g° n {Z u }™ = 0, (LB(Y t )™ U LE(Z U )J° J n {W v }\ 

Proof. Take s,q, and r as in Tabled To understand scales, recall that on 
Z^, r is on the order of n 2 , meaning that s,q and r are on the order of 
n (d+12)/8 5 n (rf+4)/4 j and n (3d+4)/8 respec tively. 

Let f/ C [2s + 1, r — s] be the set of times t in the interval [2s + 1, r — s] 
that are locally retained (see Definition |3J) . As the probability of each time 
being locally retained is constant on [2s + 1, r — s], 

7 r = E|LE s (^-| s+1 | 

r—s 

= E p t teC/ ] 

t=2s+l 

= (r - 3s)F[q <G U]. 

As s = o(r), proving (|51j) reduces to computing ]im n ^ 00 W\q G £/]. On the 
torus in dimension d > 5, the expected number of loops of length greater 
than n 7 / 4 in a run of length r is bounded by 

r r r 

E E p[^ = ^-]< E ^o[^ = °] 

j=0 j =i+n 7/4 fc=n 7 /4 



For d > 5, this expression is o(l). Moreover, Lemma 14,11 and the proof of 
Corollary 14. II imply that 

F[3T G [n 7 / 4 ,n 9 / 5 ] : <^>f =T _ n7/4 n (X t )J%£ = 0] = 1 - o(l). 

Combining these two facts, the probability that (Xt)f— has a cutpoint in 
the time interval n 7 / 4 < t < n 9 / 5 is 1 — o(l). The importance of having 
cutpoints is that whether or not the point Xj survives loop-erasure can be 
determined from only considering what happens between two cutpoints, one 
at a time before j, and one at a time after j. As 9/5 < 2, with probability 
1 — o(l), any run of length n 9 / 5 remains inside a cube of edge length rt, 
and in particular does not see the difference between the torus and the full 
lattice Z d . Combining these facts, 



[qeU} = ¥[LE(X t )L-s n {x t }l 



l 



= P[LE(X,) 9 = _ n9/5 n {X t }ti = 0] + 
(53) = nmYt)to n {Z u }Cl = 0] + o(l). 

Note that if (LE(y t )£o n {Z u }™ = i) + (LE^g, 5 n {ZuY^X) , then ei- 
ther {Y t } t>n9 /5 n {Z u } u>0 / or {yt}t> n {Z u } u>n g/ 5 / 0. But a first 
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,9/5 



l-ol) 



moment argument shows that 

{z^Z^w n {^}*=i) = ({Zu}™ =1 n {? t }« 

which completes the proof of (|51|) . 

To prove let S = LE s (X t )^ 2 s s+1 . As a 

need to compute ECap r S. For a simple random walk {^fcl^Lo on let 
= inf{£; > : 6 S}, and Tg = inf{A; > 1 : lj 6 S}. Considering the 
time reversal and again letting U C [2s + 1, r — s] denote the locally retained 
times, 



r- 2 ECap r S|G n |, we 



C aPr S=^2^2F n [T S = k,X j 
jeUk=o 



EEE 



,\t s = k,Xj = Y k ]n 



k=0j£U z&Zi 



z n 



fc=o jet/ 

Let be an indicator function for U and let W denote the event 

{LE(X t )° = _ s n{X t }f =1 =0}. Then 



ECap r S = ^ ^ E[E[lu(j)F Xj (Ts>k)\(X t y t=0 ]] 

k=0j=2s+l 



n 



p[w,LE s (x t )^-^ n {y„}* = | y = x 



= E E ^ E 

fc=0 j=2s+l 

There are fewer than 3rs terms in which k < s or j ^ [3s, r — 2s], each 
of which is bounded by n~ d . The sum of these terms thus contributes at 
most 3rsn~ d , which is of a lower order than E Cap r S (which is on the order 
of r 2 n~ d ). As ECap r S' = ar 2 n~ d , it thus suffices to show that for the 



(1 + o(l)) terms with k > s and j G [3s, r — 2s], we uniformly obtain 



lim E 

n— >oo 



[W,LE.<x t )££i, n = I = x ] 



LE<y t )g°ULE(Z u )f > )n{W v } 



I oo 

ri 



As before, since we are only running the walk for times on the order of r, the 
probability that there are no loops of length longer than n 7//4 is 1 — o(l). We 
then convert from a statement on the torus to one on the full lattice exactly 
as before. The convergence is uniform because the analogous conversions to 
l|53*|) rely only on the existence of these cutpoints. □ 
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9. Expanders, Hypercubes, and Proof of Theorem 11.31 

We stated in Section ^ that sequences of expander graphs satisfy the 
assumptions of Theorem 11.21 This is immediate from the fact that, for a 
sequence of expanders, there exist C > and A < 1 such that the bound 
|P D [-Xt = o] — |G n | _1 | < CA* holds for the entire sequence. 

We likewise claimed that © applies on the hypercubes TH\. To see this, 
consider two random walks {Aj} and {Y u } with different starting points, 
both run in continuous time with rate 1. The expected amount of time of 
intersection is the expected number of intersections for two discrete time 
walks. In continuous time, 

(54) F[X t = Y u ) < (l + e -2(*+«)/^ n ~ 1 (i _ e -nt+u)/n 

For t + u < n 1 / 4 , bound bv (1 - exp[-2(t + u)/n])/2 < 2(t + u)/n. As 
■ x/2 for < x < 1, we obtain the bound 



1 + exp 



2(t + u) 



, t + u 
< 1 < exp 

n 



t + u 



n 



n 

for n 1 / 4 < t + u < n/2, and then finally 

lj (l + e-^^y- 1 < (l + e-T 

for (t + u) > n/2 yields 

f [ r p[X t = Y u ]dtdu = o(l), 
Jo Jo 

which in turn says that the expected time of overlap of the two paths is o(l). 
As we ran the continuous time walks for time r 3> q, the expected number 
of intersections in the first q steps of the discrete walks is also o(l). 

Turning now to the proof of Theorem 11.31 note that Theorem 11.31 differs 
from Theorem 11.21 in two ways: first, we need to show that assumption (jEJ) 

allows us to omit the hypothesis that {x\, . . . , Xk} = {%i , ■ ■ ■ > are 
chosen uniformly, and second, we need to show that lim/3 n = 1. 

To show that we can choose {x\,... ,Xk} freely under assumption ©, 
let Tq = {p}, and for 1 < i < k, let {A]}^ be i.i.d., weighted random 
walks on G* n L with Aq = Xj. Let Tj = min{t > : X\ S and take 

^ = r i _ 1 ULE(A|)£ . 

Let s be as in Table ^ As the L°° distance between the distribution 
of X\ and the uniform is o(|G n | _1 ) for t > s, we only need to show that 
P[Tfc < s] = o(l). The main concern is intersections that might occur from 
Xk being close to {xi, . . . , x^-i}- After running for s steps, the X\ are close 
to uniform. Using the expected number of intersections between {A t fc }t <s 
and {Xl}t> s to bound the probability of such an intersection occurring, we 
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obtain 



(55) P[T fc < s] < fcmaxP {X l t } t<s Pi {X«} t<s + 

i<k L 



\Gn\ 



But assumption © is the fact that P [{X l t } t<s n {Xf} t<s / 0] = o(l), so 
as ETj < LjGnl 1 / 2 , equation (|55|) is exactly what we need. 

For the second part, assumption © implies that any point is a local 
cutpoint with probability 1 — o(l), so 7 = 1 — o(l). Likewise, © implies 
that the probability of a run of length r intersecting LE s (^4j) more than 
once, even conditioned on there being an intersection, is o(l). This means 
that the probability of an intersection is, up to a factor of 1 + o(l), the same 
as the expected number of intersections, and so we also have a = 1 — o(l). 
In particular, (3 n = 1 + o(l), which completes the proof of Theorem 11.31 

10. Further questions 

Although the main results of this paper give a good picture of the scaling 
limit of UST on many graphs, there are still a number of questions that 
remain. 

(1) Is the UST on the complete graph in some sense smaller than on any 
other vertex transitive graph? More precisely, if {G n } are vertex 
transitive, and x and y are uniformly chosen from G n , is there a 
constant C such that 



[d T (x,y) > \\G n \ 1 ' 2 } > exp 



21 



-4 



Benjamini and Kozma 5! asked an averaged form of this question, 
asking if Ed T (x,y) > C\G n \ 1 / 2 holds. 

(2) Theorems II 1 lllT~3"l only prove that the scaling limit of the UST is the 
Brownian CRT in the sense that the finite dimensional distributions 
converge. Does this convergence also hold in a stronger topology? 

(3) Our theorems do not apply to the torus Z 4 because r and IGVJ 1 / 2 are 
on the same order of magnitude in dimension 4. After taking into 
account a logarithmic correction factor, the scaling limit of LERW 
on Z 4 , however, is still Brownian motion 11. As discussed in 0, 
heuristics suggest that Kdrr(x, y) is on the order of n 2 log 1//6 n. If so, 
what is the limiting distribution of dq-(x,y)l What is the scaling 
limit of the UST on Z 4 ? 

(4) In this paper, we have focused on the intrinsic geometry of the UST, 
discussing distances in the UST. We can also ask about the existence 
of a scaling limit in the extrinsic geometry induced by embedding 
our graph in the torus of side length 1. The path of the LERW is 
asymptotically dense in this embedding, but lifting to the universal 
cover M d and dividing lengths by n d l i (the square root of the typical 
length of a path) , Theorem 11.11 suggests the following scaling limit 
for the lifted UST: first, the LERW from x to y lifts to a Brownian 
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motion on M. d run for a random amount of time T that is Rayleigh 
distributed. Lifting the partial spanning tree defined by fc-points in 
this way, we obtain an embedding in M. d of the first k steps of the 
Poisson line breaking constructing for the Brownian CRT, where an 
edge length of length £ in the CRT corresponds to a Brownian path 
run for time £. (This is a version of Le-Gall's Brownian snake.) 
However, establishing this picture requires further work. 
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